Unsupervised gene selection using biological knowledge : application in sample clustering
نویسندگان
چکیده
منابع مشابه
Unsupervised Gene Selection and Clustering Using Simulated Annealing
When applied to genomic data, many popular unsupervised explorative data analysis tools based on clustering algorithms often fail due to their small cardinality and high dimensionality. In this paper we propose a wrapper method for gene selection based on simulated annealing and unsupervised clustering. The proposed approach, even if computationally intensive, permits to select the most relevan...
متن کاملUnsupervised sample reduction using clustering for intrusion detection system
Analysis of network traffics, financial transactions, and mobile communications are examples of applications where examining all samples of a large dataset is computationally expensive, and requires significant memory space. A common approach to address this challenge is to reduce the number of samples without compromising the accuracy of the analysis. In this paper, we propose a new cluster-ba...
متن کاملBiological Data Mining for Genomic Clustering Using Unsupervised Neural Learning
The paper aims at designing a scheme for automatic identification of a species from its genome sequence. A set of 64 three-tuple keywords is first generated using the four types of bases: A, T, C and G. These keywords are searched on N randomly sampled genome sequences, each of a given length (10,000 elements) and the frequency count for each of the 4 = 64 keywords is performed to obtain a DNA-...
متن کاملUnsupervised Image Segmentation Using a Hierarchical Clustering Selection Process
In this paper we present an unsupervised algorithm to select the most adequate grouping of regions in an image using a hierarchical clustering scheme. Then, we introduce an optimisation approach for the whole process. The grouping method presented is based on the maximisation of a measure that represents the perceptual decision. The whole strategy takes profit from a hierarchical clustering to ...
متن کاملApplication of Clustering for Unsupervised Language Learning
We describe a method for automatically learning word similarity from a corpus. We constructed feature vectors for words according to their appearance in different dependency paths in parse trees of corpus sentences. Clustering the huge amount of raw data costs too much time and memory, so we devised techniques to make the problem tractable. We used PCA to reduce the dimensionality of the featur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BMC Bioinformatics
سال: 2017
ISSN: 1471-2105
DOI: 10.1186/s12859-017-1933-0